Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
The future of the STEM workforce rests partly on the strength of the STEM teacher workforce to teach and nurture new generations of STEM graduates. However, the STEM teacher workforce is facing critical decline with the annual production dropping from about 31,000 a decade ago to around 20,000 in the last few years. This is concerning given the need for more STEM teachers to meet rising demands. Although production is decreasing, there are improvements in the diversity and qualifications of STEM teachers, including more female teachers and those with higher degrees in STEM fields. Investments in teacher salaries and financial support for STEM education can help address the shortage and improve the future STEM teacher workforce and STEM workforce.more » « lessFree, publicly-accessible full text available December 1, 2026
-
Achieving GPT-4o level performance in astronomy with a specialized 8B-parameter large language modelAbstract AstroSage-Llama-3.1-8B is a domain-specialized natural-language AI assistant tailored for research in astronomy, astrophysics, cosmology, and astronomical instrumentation. Trained on the complete collection of astronomy-related arXiv papers from 2007 to 2024 along with millions of synthetically-generated question-answer pairs and other astronomical literature, AstroSage-Llama-3.1-8B demonstrates remarkable proficiency on a wide range of questions. AstroSage-Llama-3.1-8B scores 80.9% on the AstroMLab-1 benchmark, greatly outperforming all models—proprietary and open-weight—in the 8-billion parameter class, and performing on par with GPT-4o. This achievement demonstrates the potential of domain specialization in AI, suggesting that focused training can yield capabilities exceeding those of much larger, general-purpose models. AstroSage-Llama-3.1-8B is freely available, enabling widespread access to advanced AI capabilities for astronomical education and research.more » « lessFree, publicly-accessible full text available December 1, 2026
-
Free, publicly-accessible full text available August 4, 2026
-
Free, publicly-accessible full text available January 28, 2026
-
AI is now a cornerstone of modern dataset analysis. In many real world applications, practitioners are concerned with controlling specific kinds of errors, rather than minimizing the overall number of errors. For example, biomedical screening assays may primarily be concerned with mitigating the number of false positives rather than false negatives. Quantifying uncertainty in AI-based predictions, and in particular those controlling specific kinds of errors, remains theoretically and practically challenging. We develop a strategy called multidimensional informed generalized hypothesis testing (MIGHT) which we prove accurately quantifies uncertainty and confidence given sufficient data, and concomitantly controls for particular error types. Our key insight was that it is possible to integrate canonical cross-validation and parametric calibration procedures within a nonparametric ensemble method. Simulations demonstrate that while typical AI based-approaches cannot be trusted to obtain the truth, MIGHT can be. We apply MIGHT to answer an open question in liquid biopsies using circulating cell-free DNA (ccfDNA) in individuals with or without cancer: Which biomarkers, or combinations thereof, can we trust? Performance estimates produced by MIGHT on ccfDNA data have coefficients of variation that are often orders of magnitude lower than other state of the art algorithms such as support vector machines, random forests, and Transformers, while often also achieving higher sensitivity. We find that combinations of variable sets often decrease rather than increase sensitivity over the optimal single variable set because some variable sets add more noise than signal. This work demonstrates the importance of quantifying uncertainty and confidence—with theoretical guarantees—for the interpretation of real-world data.more » « lessFree, publicly-accessible full text available August 26, 2026
-
Multiple case-controlled studies have shown that analyzing fragmentation patterns in plasma cell-free DNA (cfDNA) can distinguish individuals with cancer from healthy controls. However, there have been few studies that investigate various types of cfDNA fragmentomics patterns in individuals with other diseases. We therefore developed a comprehensive statistic, called fragmentation signatures, that integrates the distributions of fragment positioning, fragment length, and fragment end-motifs in cfDNA. We found that individuals with venous thromboembolism, systemic lupus erythematosus, dermatomyositis, or scleroderma have cfDNA fragmentation signatures that closely resemble those found in individuals with advanced cancers. Furthermore, these signatures were highly correlated with increases in inflammatory markers in the blood. We demonstrate that these similarities in fragmentation signatures lead to high rates of false positives in individuals with autoimmune or vascular disease when evaluated using conventional binary classification approaches for multicancer earlier detection (MCED). To address this issue, we introduced a multiclass approach for MCED that integrates fragmentation signatures with protein biomarkers and achieves improved specificity in individuals with autoimmune or vascular disease while maintaining high sensitivity. Though these data put substantial limitations on the specificity of fragmentomics-based tests for cancer diagnostics, they also offer ways to improve the interpretability of such tests. Moreover, we expect these results will lead to a better understanding of the process—most likely inflammatory—from which abnormal fragmentation signatures are derived.more » « lessFree, publicly-accessible full text available August 26, 2026
-
Two-dimensional van der Waals (vdW) magnetic materials hold promise for the development of high-density, energy-efficient spintronic devices for memory and computation. Recent breakthroughs in material discoveries and spin-orbit torque control of vdW ferromagnets have opened a path for integration of vdW magnets in commercial spintronic devices. However, a solution for field-free electric control of perpendicular magnetic anisotropy (PMA) vdW magnets at room temperatures, essential for building compact and thermally stable spintronic devices, is still missing. Here, we report a solution for the field-free, deterministic, and nonvolatile switching of a PMA vdW ferromagnet, Fe3GaTe2, above room temperature (up to 320 K). We use the unconventional out-of-plane anti-damping torque from an adjacent WTe2layer to enable such switching with a low current density of 2.23 × 106A cm−2. This study exemplifies the efficacy of low-symmetry vdW materials for spin-orbit torque control of vdW ferromagnets and provides an all-vdW solution for the next generation of scalable and energy-efficient spintronic devices.more » « less
-
Abstract External factors such as exposure to a chemical, drug, or toxicant (CDT), or conversely, the lack of certain chemicals can cause many diseases. The ability to identify such causal CDTs based on changes in the gene expression profile is extremely important in many studies. Furthermore, the ability to correctly infer CDTs that can revert the gene expression changes induced by a given disease phenotype is a crucial step in drug repurposing. We present an approach for Predicting Upstream REgulators (PURE) designed to tackle this challenge. PURE can correctly infer a CDT from the measured expression changes in a given phenotype, as well as correctly identify drugs that could revert disease-induced gene expression changes. We compared the proposed approach with four classical approaches as well as with the causal analysis used in Ingenuity Pathway Analysis (IPA) on 16 data sets (1 rat, 5 mouse, and 10 human data sets), involving 8 chemicals or drugs. We assessed the results based on the ability to correctly identify the CDT as indicated by its rank. We also considered the number of false positives, i.e. CDTs other than the correct CDT that were reported to be significant by each method. The proposed approach performed best in 11 out of the 16 experiments, reporting the correct CDT at the very top 7 times. IPA was the second best, reporting the correct CDT at the top 5 times, but was unable to identify the correct CDT at all in 5 out of the 16 experiments. The validation results showed that our approach, PURE, outperformed some of the most popular methods in the field. PURE could effectively infer the true CDTs responsible for the observed gene expression changes and could also be useful in drug repurposing applications.more » « less
-
None (Ed.)G protein-coupled receptors (GPCRs) are an ancient family of signal transducers that are both abundant and consequential in metazoan endocrinology. The evolutionary history and function of the GPCRs of the decapod superfamilies of gonadotropin-releasing hormone (GnRH) are yet to be fully elucidated. As part of which, the use of traditional phylogenetics and the recycling of a diminutive set of mis-annotated databases has proven insufficient. To address this, we have collated and revised eight existing and three novel GPCR repertoires for GnRH of decapod species. We developed a novel bioinformatic workflow that included clustering analysis to capture likely GnRH receptor-like proteins, followed by phylogenetic analysis of the seven transmembrane-spanning domains. A high degree of conservation of the sequences and topology of the domains and motifs allowed the identification of species-specific variation (up to ~70%, especially in the extracellular loops) that is thought to be influential to ligand-binding and function. Given the key functional role of the DRY motif across GPCRs, the classification of receptors based on the variation of this motif can be universally applied to resolve cryptic GPCR families, as was achieved in this work. Our results contribute to the resolution of the evolutionary history of invertebrate GnRH receptors and inform the design of bioassays in their deorphanization and functional annotation.more » « less
An official website of the United States government
